A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction
نویسندگان
چکیده
Protein sequence alignment is essential for template-based protein structure prediction and function annotation. We collect 20 sequence alignment algorithms, 10 published and 10 newly developed, which cover all representative sequence- and profile-based alignment approaches. These algorithms are benchmarked on 538 non-redundant proteins for protein fold-recognition on a uniform template library. Results demonstrate dominant advantage of profile-profile based methods, which generate models with average TM-score 26.5% higher than sequence-profile methods and 49.8% higher than sequence-sequence alignment methods. There is no obvious difference in results between methods with profiles generated from PSI-BLAST PSSM matrix and hidden Markov models. Accuracy of profile-profile alignments can be further improved by 9.6% or 21.4% when predicted or native structure features are incorporated. Nevertheless, TM-scores from profile-profile methods including experimental structural features are still 37.1% lower than that from TM-align, demonstrating that the fold-recognition problem cannot be solved solely by improving accuracy of structure feature predictions.
منابع مشابه
Computational Intelligence
In this chapter, we present a brief overview of bioinformatics and computational intelligence (CI) methods including artificial neural networks, genetic algorithms, and fuzzy systems. A number of representative applications of CI methods in bioinformatics are discussed, including CI methods for gene expression analysis, for multiple sequence alignment, for protein-protein interaction prediction...
متن کاملComparison of the Lipophosphoglycan 3 Gene of the Lizard and Mammalian Leishmania: A Homology Modeling
Background: Lipophosphoglycan 3 (LPG3) is required for the LPG assembly, a well known virulent molecule. In this study, the LPG3 gene of the lizard and mammalian Leishmania species were cloned and sequenced. A three-dimensional structure (3D) for the target sequence was also predicted by comparative (homology) modeling. Materials and Methods: An optimization PCR amplification was performed o...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملDINAMO: interactive protein alignment and model building
MOTIVATION To facilitate the process of structure prediction by both comparative modeling and fold recognition, we describe DINAMO, an interactive protein alignment building and model evaluation tool that dynamically couples a multiple sequence alignment editor to a molecular graphics display. DINAMO allows the user to optimize the alignment and model to satisfy the known heuristics of protein ...
متن کاملIn Silico Analysis of Primary Sequence and Tertiary Structure of Lepidium Draba Peroxidase
Peroxidase enzymes are vastly applicable in industry and diagnosiss. Recently, we introduced a new kind of peroxidase gene from Lepidium draba (LDP). According to protein multiple sequence alignment results, LDP had 93% similarity and 88.96% identity with horseradish peroxidase C1A (HRP C1A). In the current study we employed in silico tools to determine, to which group of peroxidase enzymes LDP...
متن کامل